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GENE TARGETING IN ANIMAL CELLS 
5 USING ISOGENIC DNA CONSTRUCTS 

FIELD OF THE INVENTION 
The present, invention relates generally to methods 
for modifying the genome of animal cells, including human 
10 cells, and more particularly, to methods for modifying a 
genomic DNA sequence by homologous recombination using 
substantially isogenic DNA constructs. 

BACKGROUND OF THE INVENTION 

15 Targeted gene disruption by homologous recombination 

has met with variable success in higher eukaryotes. While it 
has been possible to isolate cells which have stably 
incorporated exogenously prepared DNA sequences, in the vast 
majority of these cells, the DNA has integrated randomly into 

2 0 the genome rather than at the desired target site via 
homologous recombination. The ratio of the number of 
homologous recombinants to the total number of integration 
events varies, but typically, when there is no direct selection 
or enrichment for homologous recombinants, less than 1% of the 

2 5 integration events result from homologous recombination and 

ratios as low as 1 in 40,000 have been observed. Variations in 
the relative targeting efficiency have not bieen clearly 
attributable to differences in the length of homologous 
sequence present in the targeting conistructs. "Nor lias any 

3 0 unequivocal correlation been documented between recombination 

efficiency and transcriptional activity of the target gene or 
chromosomal location of the target gene. 

If the homologous recombinants can only be obtained 
amidst a large background of random integration events, then it 

35 may be impractical, if not impossible, to effectively target 
many genomic sequences . The approaches taken to overcoming 
this problem have focused on developing special strategies to 
screen or select homologous recombinants from the large 
background of non-homologous or random integration events In 

40 a few situations in which the targeted gene is itself a 



dominant selectable marker, it may be feasible to select 
directly for homologous recombinants.. For example, knocking 
out the hprt gene (encoding hypoxanthine phosphoribosyl 
transferase) results in increased tolerance of the base analog 
6-thioguanine (Thomas, K. and M. Capecchi, Cell 51:503-512 
(1987) . However, such particularized methods are not widely 
applicable. Other selection procedures aim at the enrichment 
for the desired homologous recombination event by suppressing 
colony formation due to random integrations of the targeting 
construct. In single selection protocols, the targeting 
constructs contain a marker gene, typically conferring drug 
resistance, deprived of transcriptional and/ or translational 
start signals, in such a way that the juxtaposition of the 
marker gene and functional expression signals would be obtained 
on homologous recombination but only rarely on random 
integration. Sedivy, J., and P. Sharp, Proc. N at '1 Acad. Sci. 
USA 86:227-231 (1989). The double or "positive/negative" 
selection procedure developed by Capecchi and co-workers makes 
use of an autonomously expressed marker gene, but the targeting 
construct is; flanked by a second gene which is detrimental to 
the cell and which tends to be lost on homologous recombination 
but not on random integration. Mansour, S. , at al., Nature 
336:348-352 (1988) . 

Another approach has involved the use of screening 
procedures based on the polymerase chain reaction ("PCR") , in 
which pools of cells are tested for potential homologous 
recombinants using pairs of primers which will be juxtaposed 
only if homologous recombination has occurred. Any pools 
containing potential homologous recombinants are then sub- 
divided and the procedure is continued until a small enough 
pool of cells can be analyzed individually. Zimmer, A., et 
al., Nature 338:150-153 (1989); and Joyner, A., et al., Nature 
338:153-156 (1989). Besides the labor involved in screening, 
the PCR protocols also require that appropriate regions of the 
DNAs have been sequenced and that oligonucleotide primers be 
obtained. 

The relative inefficiency of homologous recombination 
is even more problematic when working with c lis that are not 
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easily reproduced in vitro and for which the aforementioned 
selection and screening protocols may be impractical;' if not 
impossible. For example, there are a large variety of cell 
types, including many stem cell types, which are difficult or 

5 impossible to clonally reproduce in vitro. If the relative 

frequency of homologous recombination itself could be improved, 
then it might be feasible to target a variety of cells which 
are not amenable to specialized isolation techniques such as 
positive/negative selection or PCR screening. (See, 
10 W091/01140, which is incorporated herein by reference.) 

Thus, there remains a significant need for gene 
targeting systems in which homologous recombinants can be 
routinely and efficiently obtained at a high enough frequency 
to obviate the necessity of special selection and screening 

15 protocols. The present invention fulfills these and other 
needs . 

SUMMARY OF THE INVENTION 
The present invention provides novel methods for 
20 modifying the genome of an animal cell comprising the steps of: 
constructing a DNA molecule in which desired sequence 
modifications are contained in a segment of DNA (a "targeting 
DNA") that is substantially isogenic with a DNA in the cell 
genome (a "target DNA") ; introducing the targeting DNA 

2 5 construct into the cell (e.g., by microinjection, 

electroporation, transf ectidn, or calcium phosphate 
precipitation) ; and selecting cells in which the desired 
sequence modifications have been introduced into the genome via 
homologous recombination. 

30 Preferably, the targeting DNA will be derived from a 

cell line that is closely related to the cell line which is 
being targeted; so that the sequence of the targeting DNA is 
substantially identical with the sequence of the target DNA 
(except for the desired sequence modifications). By using 

35 substantially isogenic targeting DNA, a substantial fraction of 
the cells in which integration has occurred will have undergone 
homologous recombination between the targeting DNA sequence and 
the target DNA sequence. Since the integration events are 



thereby enriched for homologous recombinants, it is possible to 
forego the use of special selection and screening protocols 
used to isolate rare homologous recombinants from a large 
background of non-homologous integration m events. 

Although the present invention has been applied to 
laboratory mice strains such as BALB/c and 129, the invention 
will be even more useful for gene targeting in non-murine 
animals. The typical mouse strains used in laboratories tend 
to be fairly inbred and, as a result, there is smaller 
likelihood of sequence divergence in an allele derived from 
different lines (see, e.g., Bishop, C. , et al. , Nature 315:70- 
72 (1985)). In contrast , many other animals are not so inbred, 
and there is a greater chance of sequence divergence between 
alleles derived from different individuals. The restriction 
fragment length polymorphisms ( "RFLPs" ) , useful in 
"fingerprinting" human DNA, are an example of this phenomenon 
in a non-inbred species. 

A preferred cell type for targeting the genome of a 
mammalian organism is the embryonic stem cell. Preferably, the 
DNA construct contains an antibiotic resistance marker and the 
cells are first selected on a medium containing the antibiotic. 

The present invention also provides novel methods for 
creating genetically modified animals comprising the steps of: 
modifying the genome of embryonic stem cells derived from the 
animal, as described above; introducing the modified embryonic 
stem cells into blastocysts derived from the same species of 
animal; and using a pseudo-pregnant female to carry the 
chimeric animal to term. The resulting chimeric animal can in 
turn be bred to obtain non-chimeric animals in which the 
desired genetic alteration has been stably inherited through 
germ-line transmission. 

The present invention can also be used for the direct 
targeting of animal zygotes. The targeting DNA can be 
introduced by, for example, microinjection, and then, with 
mammals for example, the modified zygotes can be transferred to 
pseudo-pregnant females capable of carrying the animal to term. 
Similarly, for somatic gene therapy, the genome of somatic 
cells of an animal is directly modified using the substantially 



isogenic targeting DNA and then the modified cells are 
introduced into the same or a different animal . 

In another aspect, the present invention provides 
cells exhibiting a recombination event at a preselected native 
target DNA site in the cell genome. Thus, in view of the 
increased efficiency of recombination utilizing the methods of 
the present invention, a collection of cells having undergone a 
recombination event will exhibit between about 10-90%, 
typically at least about 30 to 50%, recombination. The cells 
exhibiting the desired characteristics may be selected for and 
isolated in accordance with standard techniques, and grown into 
animals. 

BRIEF DESCRIPTION OF FIGURES 
Fig. 1. DNA targeting constructs. (a) The 
retinoblastoma (Rb) locus around exons 19 and 20 (black boxes) , 
in mouse strain 129. Restriction enzyme sites are as follows: 
B=BglII, E=EcoRI, H=HindIII, Hp=HpaI , P=PstI , and S=StuI. 
Except for the StuI site, these sites are also present in the 
corresponding BALB/c region. (b) The DNA targeting constructs 
containing the neo gene inserted into the Bglll site of exon 19 
within a 10.5 kb Rb Hpal fragment derived from mouse strains 
129 (targeting construct "129Rb-neo") or BALB/c (targeting 
construct "B/cRb-neo") . The neo marker was flanked by 2.5 and 
8 kb of Rb sequence. (c) and (d) . Two additional isogenic 
targeting constructs generated by inserting the hErt-minigene 
(Van der Lugt, N. , et al., Gene . (1991)) or the hyg gene (Te 
Riele, H., et al., Nature 348:649-651 (1990)) into the Bglll 
site of exon 19 within a 17 kb 129-derived Rb fragment, giving 
129 Rb - hprt (c) and 129Rb- hvcr (d) , respectively. These two 
constructs were flanked by non-endogenous Sail sites. A and B 
indicate fragments used as probes to detect modifications at 
Rb. 

Fig. 2. Sequence divergence between BALB/c and 129 
DNA at the region of homology. The upper part of the diagram 
represents the 10.5 kb. Rb sequence present in targeting 
constructs B/cRb-neo and 129Rb- neo (see Fig. lb) . The sequence 
was divided into nine smaller fragments, as shown by the solid 



vertical lines. Filled triangles represent extra nucleotides 
within a fragment in the BALB/c-derived sequence (above the 
line) or within the 129-derived sequence (below the line) . 
Open triangles indicate length differences within a fragment 
that could result from nucleotide insertions or restriction 
site polymorphisms. The lower part of the diagram shows 
nucleotide differences as determined by sequence analysis of 
the indicated regions. 

DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the present invention, gene 
targeting can be used to modify the genome of animal cells, 
including human cells, using an efficient technique involving 
homologous recombination between substantially isogenic DNA 
constructs. By introducing an exogenous "targeting DNA" into 
eukaryotic cells, selecting for cells in which the targeting 
DNA has been stably integrated into the recipient cell genome 
is readily accomplished. The methods provided for 
substantially increased frequency of recombination, one to 
three orders of magnitude higher, or more are pending upon the 
target and protocol. 

There are two general events believed to be 
responsible for stable integration. In homologous 
recombination, the incoming DNA interacts with and integrates 
into a site in the genome that contains a substantially 
homologous DNA sequence. In non-homologous ("random" or 
"illicit") integration, the incoming DNA is not found at a 
homologous sequence in the genome but integrates elsewhere, 
apparently at one of a large number of potential locations. In 
general, studies with higher eukaryotic cells have revealed 
that the frequency of homologous recombination is far less than 
the frequency of random integration. The ratio of these 
frequencies has direct implications for "gene targeting" which 
depends on integration via homologous recombination (i.e. 
recombination between the exogenous "targeting DNA" and the 
corresponding "target DNA" in the genome) . 

Gene targeting represents a major advance in the 
ability to selectively* manipulate animal cell genomes. Using 
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this technique, a particular DNA sequence can be targeted and 
modified in a site-specific and precise manner. Different 
types of DNA sequences can be targeted for modification, 
including regulatory regions, coding regions and regions of DNA. 
5 between genes. Examples of regulatory regions include: 

promoter regions, enhancer regions, terminator regions and 
introns. By modifying these regulatory regions, the timing and 
level of expression of a gene can be altered. Coding regions 
can be modified to alter, enhance or eliminate, for example, 

10 the specificity of an antigen or antibody, the activity of an 
enzyme, the composition of a food protein, the sensitivity of 
protein to inactivation, the secretion of a protein, or the 
routing of a protein within a cell. Introns and exons, as well 
as inter-genic regions, are suitable targets for modification. 

15 Modifications of DNA sequences can be of several 

types, including insertions, deletions, substitutions, or any 
combination of the preceding. A specific example of a 
modification is the inactivation of a gene by site-specific 
integration of a nucleotide sequence that disrupts expression 

2 0 of the gene product. Using such a technique to "knock out" a 

gene by targeting will avoid problems associated with the use 
of antisense RNA to disrupt functional expression of a gene 
product. For example, one approach to disrupting a target gene 
using the present invention would be to insert a selectable 
25 marker into the targeting DNA such that homologous 

recombination between the targeting DNA and the target DNA will 
result in insertion of the selectable marker into the coding 
region of the target gene. 

It may be preferable to incorporate a selectable 

3 0 marker into the targeting DNA which allows for selection of 

targeted cells that have stably incorporated the targeting DNA. 
This is especially useful when employing relatively low 
efficiency transformation techniques such as electroporation, 
calcium phosphate precipitation and liposome fusion, as 
35 discussed below, where typically fewer than 1 in 1000 cells 
will have stably incorporated the exogenous DNA. Using high 
efficiency methods, such as microinjection into nuclei, 
typically from 5-25% of the cells will have incorporated the 
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targeting DNA; and it is therefore feasible to screen the 
targeted cells directly without the necessity of first 
selecting for stable integration of a selectable marker. 

Examples of selectable markers include: genes 
conferring resistance to compounds such as antibiotics, genes 
conferring the ability to grow on selected substrates, genes 
encoding proteins that produce detectable signals such as 
luminescence. A wide variety of such markers are known and 
available, including, for example, antibiotic resistance genes 
such as the neomycin resistance gene (neo) , Southern, P. , and 
P. Berg, J. Mol. AppI. Genet. 1:327-341 (1982); and the 
hygromycin resistance gene (hyg) , Nucleic Acids Research 
11:6895-6911 (1983), and Te Riele, H. , et al. , Nature 348:649- 
651 (1990) . Selectable markers also include genes conferring 
the ability to grow on certain media substrates such as the tk 
gene (thymidine kinase) or the hprt gene (hypoxanthine 
phosphoribosyltransferase) which confer the ability to grow on 
HAT medium (hypoxanthine, aminopterin and thymidine) ; and the 
bacterial apt gene (guanine/ xanthine phosphoribosyltransferase) 
which allows growth on MAX medium (mycophenolic acid, adenine, 
and xanthine). See Song, K-Y. , et al. Proc. Nat'l Acad. Sci. 
USA 84:6820-6824 (1987). Other selectable markers for use in 
mammalian cells, and plasmids carrying a variety of selectable 
markers, are described in Sambrook, J., et al. , Molecular 
Cloning - A Laboratory Manual , Cold Spring Harbor Laboratory/ 
Cold Spring Harbor, New York (1989) (hereinafter "Sambrook"), 
see chapter 16. 

If a selectable marker is used, the preferred 
location of the marker gene in the targeting construct will 
depend on the aim of the gene targeting. For example, if the 
aim is to disrupt target gene expression, then the selectable 
marker can be cloned into targeting DNA corresponding to coding 
sequence in the target DNA. Alternatively, if the aim is to 
express an altered product from the target gene, such as a 
protein with an amino acid substitution, then the coding 
sequence can be modified to code for the substitution, and the 
selectable marker can be placed outside of the coding region, 
in a nearby intron for example. 



If the selectable markers will depend on their own 
promoters for expression and the marker gene is derived from a 
very different organism than the organism being targeted (e.g. 
prokaryotic marker genes used in targeting mammalian cells) , it 
is preferable to replace the original promoter with 
transcriptional machinery known to function in the recipient 
cells. A large number of transcriptional initiation regions 
are available for such purposes including, for example, 
metallothionein promoters, thymidine kinase promoters, beta- 
act in promoters, immunoglobulin promoters, SV4 0 promoters and 
human cytomegalovirus promoters. A widely used example is the 
pSV2-neo plasmid which has the bacterial neomycin 
phosphotransferase gene under control of the SV40 early 
promoter and confers in mammalian cells resistance to G418 (an 
antibiotic related to neomycin). Southern, P., and P. Berg, 
Mo 1 . App 1 . Genet . 1:327-341 (1982). A number of other 
variations may be employed to enhance expression of the 
selectable markers in animal cells, such as the addition of a 
poly (A) sequence ( see , e.g., Thomas, K. , et al., Cell 44:419- 
428 (1986)); and the addition of synthetic translation 
initiation sequences ( see , e.g., Thomas, K. and M. Capecchi, 
Cell 51:503-512 (1987)). Both constitutive and inducible 
promoters may be used. 

In some cases, it may be desirable for the 
modification sequences (including selectable markers) to alter 
the transcriptional activity of the target gene. However, if 
selectable markers are used and it is not desirable to affect 
transcriptional activity of the target gene, it will be 
preferable to use selectable markers with an inducible promoter 
and/or to include a transcription termination sequence 
downstream of the selectable marker. A variety of inducible 
promoters and transcription termination sequences are known and 
available. See, e.g., Sambrook, supra. 

Where the target gene is highly expressed or readily 
inducible, it may be advantageous to use selectable markers 
lacking their own promoters as a way to further enhance the 
frequency of obtaining homologous recombinants. In that way, 
the likelihood of the selectable marker being highly expressed 
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upon integration into the genome will be much greater for 
homologous recombination events (where the promoterless gene 
will have been placed in the vicinity of the target gene 
promoter) than for random integration into the genome. 
5 Target genes can also be modified by deletions. In 

the case of a deletion , the sequence to be deleted will be 
absent or removed from the corresponding targeting DNA and thus 
the "modification sequence" will constitute a missing sequence 
relative to the target DNA. The deletion will generally cover 
10 a portion of one or more exons and may include introns and 

flanking non-coding regions such as regulatory regions. The 
deletion may be as small as one base pair or as large as tens 
of thousands of base pairs. 

Another specific form of modification is the 
15 introduction of a new gene into the animal cell genome. By 
flanking the new gene with sequences substantially isogenic 
with target DNA in the host cell, it is possible to introduce 
the gene in a site-specific fashion at the targeted location. 
Using this approach, a gene from any source (e.g., bacterial, 
2 0 plant, animal) can be introduced into an animal cell to impart 
new characteristics to the cell or to allow the animal cell to 
produce desired polypeptides which can then be isolated from 
the animal or from its cells in vitro^ 

Another form of modification is the insertion of a 
25 marker gene in a region outside of but proximal to a gene of 

interest. This sort of modification results in the creation of 
a new linkage in the animal genome. For this approach, the 
precise function of a target sequence need not be known, so 
long as it is known to be associated with a particular trait. 
30 Selectable markers can be introduced into precise locations 
adjacent to desirable genes to facilitate selection of 
desirable traits that are otherwise not selectable in culture. 
This procedure is of value, for instance, in order to 
facilitate animal breeding programs. Segregation of the trait 
35 through successive generations can be tracked by growing cells 
on the appropriate selective medium. Thus, the time required 
to breed improved varieties can be shortened. As an example of 
this kind of approach, regions identified by RFLP analysis to 
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be associated with complex traits can be targeted and cells 
containing the traits can be selected in culture. 

The targeting DNA comprises a sequence in which the 
desired sequence modifications are flanked by DNA substantially 
isogenic with a corresponding target sequence in the genome to 
be modified. The substantially isogenic sequence is preferably 
at least about 97-98% identical with the corresponding target 
sequence (except for the desired sequence modifications) , more 
preferably more preferably at least about 99.0-99.5% identical, 
most preferably about 99.6 to 99.9% identical. Particularly 
for non-inbred animals (e.g., other than mice strains 129 and 
BALB/c) , the sequencers are typically 100% identical. The 
targeting DNA and the target DNA preferably share stretches of 
DNA at least about 75 base pairs that are perfectly identical, 
more preferably at least about 150 base pairs that are 
perfectly identical, even more preferably at least about 500 
base pairs that are perfectly identical. Accordingly, it is 
preferable to use targeting DNA derived from cells as closely 
related as possible to the cell line being targeted; more 
preferably, the targeting DNA is derived from cells of the same 
cell line as the cells being targeted. Most preferably, the 
targeting DNA is derived from cells of the same individual (or 
animal) as the cells being targeted. 

Preferably, the targeting DNA sequence is at least 
about 100-2 00 bp of substantially isogenic DNA, more preferably 
at least about 300-1000 bp and generally less than about 15,000 
bp. The amount of targeting DNA present on either side of a 
sequence modification can be manipulated to favor either single 
or double crossover events, both of which can be obtained using 
the present invention. In a double crossover or "replacement- 
type" event, the portion of the targeting DNA between the two 
crossovers will replace the corresponding portion of the target 
DNA. In a single crossover or "insert ion- type" event, the 
entire targeting DNA will generally be incorporated into the 
target sequence at the site of the single crossover. To 
promote double crossovers, the modification sequences are 
preferably flanked by targeting DNA such that, upon 
linearization, the modification. sequences are located towards 
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the middle of the flanking targeting DNA. If single crossovers 
are desired , the targeting DNA should be designed such that the 
ends of the linearized targeting sequence correspond to target 
DNA sequences lying adjacent to each other in the genome, as 
5 described by Thomas, K. , and M. Capecchi, Cell 51:503-512 
(1987) . 

The DNA delivery molecule may contain only the 
targeting DNA with modification sequences or it may contain 
additional DNA flanking the targeting DNA, If this additional 

10 DNA contains a selectable marker, then it may be possible to 

further enrich for cells which have undergone double crossover 
homologous recombination because these cells will generally 
have lost the flanking selectable marker located outside the 
targeting DNA. Conversely, cells which have stably 

15 incorporated the flanking selectable marker are likely to have 
arisen by random integration of the DNA construct into the 
genome. One such flanking selectable marker is the HSV-tk gene 
which confers sensitivity to the antibiotic gancyclovir. 
Mansour, S., et al. , Nature 336:348-352 (1988). 

2 0 Combinations of selectable markers can also be used 

to advantage. For example, to target non-selectable gene "X," 
a neo gene (with or without its own promoter, as discussed 
above) can be cloned into a DNA sequence which is substantially 
isogenic with gene X. As discussed above, the placement of 

2 5 this marker gene, particularly whether it is in an exon or 

outside the coding sequence, will depend on the aim of the gene 
targeting. To use a combination of markers, the HSV-tk gene 
can be cloned such that it is outside of the targeting DNA 
(another selectable marker could be placed on the opposite 

3 0 flank, if desired) . After introducing the DNA construct into 

the cells to be targeted, the cells can be selected on the 
appropriate antibiotics. In this particular example, those 
cells which are resistant to G418 and gancyclovir are most 
likely to have arisen by homologous recombination in which the 
35 neo gene has been recombined into gene X but the tk gene has 
been lost because it was located outside the region of the 
double crossover. As discussed above, it will be necessary to 
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ensure that the selectable markers are adequately expressed in 
the recipient cells. 

The targeting DNA construct may also contain 
replication systems which are functional in prokaryotes, 
5 especially coli , which were of use in constructing the DNA 
molecule, and for performing and analyzing genetic 
manipulations of the targeting sequence. Preferably, however, 
DNA sequence not required for the gene targeting is removed 
prior to introducing the DNA into cells to be targeted. 

10 The DNA delivery molecule containing the targeting 

DNA may also contain DNA sequences or proteins that affect the 
uptake of the DNA delivery molecule or the fate of the molecule 
after introduction into the cells. For example, the DNA 
delivery molecule may be a viral capsid containing the 

15 targeting DNA, as discussed below. Also, the DNA delivery 

molecule may contain sequences or DNA binding proteins that 
affect degradation or localization of the molecule following 
entry into the targeted cells, or that affect the catalysis of 
homologous recombination . 

2 0 Transformation of animal cells with the recombinant 

construct containing the targeting DNA can be carried out using 
essentially any method for introducing nucleotide sequences 
into animal cells including, as discussed below, 
microinjection, electroporation, calcium phosphate 

25 precipitation, and transfection using a virus or viral 
particle. 

After the targeting DNA has been introduced into the 
animal cells, the cells in which the targeting DNA has stably 
integrated into the genome can be selected. The choice of 

30 which one to use will generally depend upon the nature of the 
sequence that has been integrated. For example, if the 
targeting DNA contains a selectable marker, as described above, 
then the integration of targeting DNA into the genome results 
in the stable acquisition of the selectable marker. In some 

35 situations the cells may be selected by virtue of a 

modification of the target gene. For example, if the target 
gene has a selectable phenotype, then modification of the 
target DNA may result in loss or alteration of that phenotype. 
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In other situations, a selectable phenotype may result from 
juxtaposition of a DNA sequence present on the targeting DNA 
with DNA sequences present near the target DNA. For example r 
integration of a promoterless antibiotic resistance gene at the 
target site may result in expression of the resistance gene 
based on transcriptional activity at the target site. 

It is also possible, although not essential, to use 
the polymerase chain reaction (PCR) to screen cells in which 
homologous integration has occurred. In an advantageous 
application, one PCR primer is directed to DNA in the 
modification sequence and another primer is directed to DNA 
near the target locus that is outside but proximal to the 
target DNA, such that integration results in the creation of a 
genomic DNA sequence in which the primer binding sites are 
facing each other in relative juxtaposition. After a number of 
rounds of amplification, DNA from such a locus will be present 
at much higher levels because it is being amplified 
exponentially rather than linearly. 

Homologous recombination can be confirmed using 
standard DNA hybridization techniques, such as Southern 
blotting, to verify the presence of the integrated DNA in the 
desired genomic location. 

Where the cells contain more than one copy of a gene, 
the cell lines obtained from the first round of targeting are 
likely to be heterozygous for the targeted allele. 
Homozygosity, in which both alleles are modified, can be 
achieved in a number of ways. One approach is to grow up a 
number of cells in which one copy has been modified and then to 
subject these cells to another round of targeting using a 
different selectable marker. Alternatively, homozygotes can be 
obtained by breeding animals heterozygous for the modified 
allele, according to traditional Mendel ian genetics. In some 
situations, it may be desirable to have two different modified 
alleles. This can be achieved by successive rounds of gene 
targeting or by breeding heterozygotes, each of which carries 
one of the desired modified alleles. 

The present invention can be used with a variety of 
cell types derived from a number of animal sources. As 
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discussed above, the invention is especially useful with 
animals, such as non-murine animals, in which inbreeding is not 
very common. The choice of particular cell types for targeting 
will generally depend on the purposes for which the site- 
5 directed mutagenesis is undertaken. For example, if whole 
animals carrying a particular mutation are desired, then 
embryonic stem cells derived from that animal can be targeted 
and later introduced into blastocysts for growing the modified 
cells into chimeric animals. For embryonic stem cells, either 

10 an embryonic stem cell line or freshly obtained stem cells may 
be used. The resulting chimeric animals can be bred in order 
to obtain non-chimeric animals in which the mutation has been 
transmitted through the germ line. 

Another approach to creating genetically altered 

15 animals that can be used with the present invention is to 

modify zygotes directly. For mammals, the modified zygotes can 
be then introduced into the uterus of a pseudopregnant female 
capable of carrying the animal to term. 

Besides altering organisms through germ-line 

2 0 modifications, gene targeting can also be used to modify 

somatic cells. Cells of interest for somatic gene targeting 
include hematopoietic cells, T-lymphocytes and other cells of 
the immune system, epithelial cells, endothelial cells, adrenal 
medulla cells, keratinocytes , fibroblasts, osteoblasts, 
25 osteoclasts, neurons, ganglion cells, retinal cells, liver 

cells, myoblast cells, and cells of the islets of Langerhans. 
Also of interest will be the stem cells which serve as the 
progenitors of the above cells and which may be an original 
progenitor cell or a progenitor cell that has already become 

3 0 dedicated to a particular cell lineage. 

In addition to applications such as the production of 
transgenic animals and gene-therapy, the techniques of the 
present invention are also useful in expanding basic knowledge 
with respect to animal cell function. For example, the 
35 .expression of altered forms of genes and their promoters can be 
analyzed without position effects because the gene is altered 
in situ ; and the function of sequences whose purpose is unknown 
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can be determined by inactivating the sequence and observing 
changes in c 11 function. 

The following list of terms, intended to supplement 
the descriptions above, will be useful in understanding the 
present invention: 

Target DNA sequence 

The DNA to be modified by homologous recombination. 
The target DNA can be in any organelle of the animal cell 
including the nucleus, and mitochondria and can be an intact 
gene, an exon or intron, a regulatory sequence or any region 
between genes. 

Desired sequence modifications 

Sequence changes that it would be desirable to 
introduce Into the target DNA. These sequence modifications 
may include insertions, deletions or substitutions of DNA 
sequence, or any combination thereof, and may be as small as a 
single base pair or as large as tens of thousands of base 
pairs. Insertions include the insertion of entire genes which 
may be of animal, plant, prokaryotic or viral origin. 

Targeting DNA sequence 

A DNA sequence containing the desired sequence 
modifications and which is, except for the sequence 
modifications, substantially isogenic with the target DNA. 

DNA delivery molecule 

The molecule comprising at least the targeting DNA 
which is introduced into cells to be targeted. 

Uninterrupted sequence identity 

The length of a stretch of DNA sequence that is 
identically conserved between two homologous DNA sequences. 

Average sequence identity 

The percentage of DNA sequence that is identically 
conserved between two homologous DNA sequences. For example, 



if a first DNA sequence comprises 200 base pairs and a second 
sequence differs at two sites (e.g. a small insertion such as 4 
nucleotides , and a small deletion) , then the average sequence 
identity is about 99%. 

Homologous DNA sequence or homologous DNA 

DNA sequence that is at least about 70% identical 
with a reference DNA sequence. An indication that two 
sequences are homologous is that they will hybridize with each 
other under fairly stringent conditions (see, e.g., Maniatis or 
Sambrook, infra) . 

Isogenic or substantially isogenic DNA 

DNA sequence that is identical with or nearly 
15 identical with a reference DNA sequence. Indications that two 
sequences are isogenic is that they will hybridize with each 
other even under the most stringent hybridization conditions 
(see, e.g. Maniatis or Sambrook, infra) ; and will not exhibit 
sequence polymorphisms (i.e. they will not have different sites 

2 0 for cleavage by restriction endonucleases) . The term 

"substantially isogenic" refers to DNA that is at least about 
97-99% identical with the reference DNA sequence, and 
preferably at least about 99.5-99.9% identical with the 
reference DNA sequence, and in certain uses 100% identical with 

25 the reference DNA sequence. Indications that two sequences are 
substantially isogenic is that they will still hybridize with 
each other under the most stringent conditions (see, Sambrook) 
and they will only rarely exhibit RFLPs or sequence 
polymorphisms (relative to the number that would be 

30 statistically expected for sequences of their particular length 
which share at least about 97-99% sequence identity) . 

Homologous recombination 

The term "homologous recombination" refers to the 

3 5 process of DNA recombination based on sequence homology. The 

term embraces both crossing over and gene conversion. Cellular 
recombination enzymes are believed to be involved in the 
process of recognizing sequence identity between distinct 
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nucleotide sequences. Three distinct types of homologous 
recombination have been distinguished based on the nature of 
the recombination substrates and the mechanisms believed to be 
involved in mediating recombination: "chromosomal 
recombination," "extrachromosomal recombination" and "gene 
targeting" (see definitions, infra) . 

Chromosomal recombination 

Homologous recombination between two DNA sequences 
within a single chromosome ( "intrachromosomal recombination") 
or recombination between chromosomes ("interchromosomal 
recombination") . A common example of interchromosomal 
recombination is the mitotic recombination between homologous 
chromosomes . 

Extrachromosomal recombination 

Homologous recombination between two DNA sequences 
neither of which are located on chromosomes. An example of 
extrachromosomal recombination is the recombination between two 
viruses transfected into a single recipient cell. 

Gene targeting 

Homologous recombination between two DNA sequences, 
one of which is located on a chromosome and the other of which 
is not. 

Non-homologous or "random" integration 

Any process by which DNA is integrated into the 
genome that does not involve homologous recombination. It 
appears to be a random process in which incorporation can occur 
at any of a large number of genomic locations. 

Selectable marker 

A gene, the expression of which allows cells 
containing the gene to be identified on a particular medium. A 
selectable marker can be one that allows a cell to proliferate 
on a medium that prevents or slows the growth of cells without 
the gene. Examples include antibiotic resistance genes and 
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genes which allow an organism to grow on a selected metabolite. 
Alternatively, the gene can facilitate visual screening of 
transformants by conferring on cells a phenotype that is easily 
identified. Such an identifiable phenotype may be, for 
example, the production of luminescence or the production of a 
colored compound, or the production of a detectable change in 
the medium surrounding the cell. 

Animal cell 

A cell of a multicellular eukaryotic organism of, for 
example, the phyla chordata, echinodermata , coelenterata, 
annelida, and arthropoda. Preferably, the animal cells are 
from an animal belonging to the phylum chordata, more 
preferably the subphylum vertebrata. Most preferably, the 
animal cells are non-murine mammalian cells, including human 
cells. 

EXPERIMENTAL 

A . General Methods 

Generally, the nomenclature and standard laboratory 
procedures with respect to recombinant DNA technology can be 
found in Maniatis, T. et al. , . Molecular Cloning - A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York (1982) (hereinafter "Maniatis") ; and now in Second Edition 
by Sambrook, J., et al. (1989) (hereinafter "Sambrook") . Other 
general references are provided throughout this document. The 
procedures therein are believed to be well known in the art, 
and are provided for the convenience of the reader. Maniatis, 
Sambrook, and the other general references are specifically 
incorporated herein by reference. 

B. Preparation of the Targeting DNA 
1) Isolation of Isogenic DNA 

The targeting DNA comprises a sequence in which the 
desired sequence modifications are flanked by DNA substantially 
isogenic with a corresponding target sequence in the genome to 
be modified. The targeting DNA can be constructed exclusively 
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from genomic DNA, from cDNA, from synthetic DNA or from any 
combination of the above. The genomic DNA can be cloned from a 
library of genomic DNA fragments in a bacteriophage vector 
(e.g., lambda phage), in a plasmid vector (e.g., pBR322 
derivative), or in a cosmid vector; using techniques well-known 
in the art of recombinant DNA. cDNA can, e.g., be prepared 
from a mRNA population which forms the basis of preparation of 
a cDNA library. Alternatively, synthetic DNA fragments can be 
prepared based upon knowledge of the nucleotide sequence of the 
target DNA. 

2) Modifying the Targeting DNA 

Modification of the targeting DNA will depend on two 
basic considerations: firstly, what modifications are desired 
in the target DNA; secondly, whether selectable sequences 
should be included as an aid in isolating homologous 
recombinants . 

In some situations, the modified targeting DNA will 
already be available. If for example, a mutant version of a 
particular gene is already available as a recombinant DNA 
construct, then the targeting DNA may be obtained from that 
source using standard cloning techniques. See , e.g., Sambrook. 
As discussed above, the efficiency of homologous recombination 
depends in part on the isogenicity of the targeting DNA and the 
target DNA. Where the modification is available in a different 
cell line than that being targeted, it may be preferable to 
clone the modification sequence out of the original DNA and 
into DNA sequence that is more nearly isogenic with the target 
DNA. In general, such cloning will be performed in prokaryotic 
organisms, using standard cloning techniques. Id. 

If a targeting DNA with desired sequence 
modifications is not already available, then a fragment of 
substantially isogenic targeting DNA can be obtained and 
modified. Generally, the isogenic targeting DNA will be most 
easily modified after being cloned onto vectors that can be 
used in prokaryotic organism such as E_j_ coli . If an 
appropriate fragment of isogenic targeting DNA is not already 
available, then a gene library of the cell line to be targeted 



can be prepared and screened for the desired sequence using 
techniques well known in the art. See, e.g., Sambrook. Once 
the targeting DNA is cloned, insertions, deletions and 
alterations of DNA sequences can be achieved by techniques well 
known in the art. See, Sambrook. If the sequence of the target 
DNA is known, it is also possible to obtain synthetic DNA 
fragments in which one or more of the base pairs are 
specifically altered, added or removed. 

One particular type of modification is the insertion 
of a gene within the targeting DNA. As discussed above, it 
will often be advantageous to include a selectable marker as an 
insert to facilitate selection of recombinants. A wide variety 
of such markers are generally known and available; and can be 
readily cloned into a desired targeting DNA construct using 
well known techniques. See, e.g., Sambrook. 

C. Construction of DNA Delivery M olecules 

Typically, cloning of the targeting DNA will be 
performed in coli, and standard plasmids such as pUC and 
pBR322 derivatives can be used. In many cases, it will be 
possible to use these plasmid DNAs directly as DNA delivery 
molecules; but it is preferable to minimize the amount of 
extraneous DNA on the delivery molecule. Thus, DNA sequence 
that will not be involved in targeting or selection of 
homologous recombinants can be removed prior to introduction of 
the DNA into the recipient cells. Preferably , the DNA will be 
linearized by cutting with a restriction enzyme prior to 
introduction into the cell. See, Sambrook. Where biological 
methods of DNA introduction are used, such as a virus or viral 
capsid, the DNA delivery molecule will be tailored accordingly 
to the particular system. For example, particular viral 
capsids generally work most efficiently with DNA sequences that 
are within a particular size range. 

As discussed above, the DNA delivery molecule 
containing the targeting DNA may also contain DNA sequences or 
proteins that affect the uptake of the molecule or the fate of 
the molecule after introduction into the cells. For example, 
the DNA delivery molecule may be a viral capsid containing the 
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targeting DNA, as discussed below. Also, the DNA delivery 
molecule may contain sequences or DNA binding proteins that 
affect degradation or localization of the molecule following 
entry into the targeted cells or molecules that affect the 
5 catalysis of homologous recombination. 

D. Cells to be Targeted 
The present invention can be used with essentially 

any cell into which DNA can be introduced. As discussed in the 
following section, there are a variety of methods applicable 
for introducing DNA into animal cells. The choice of cell type 
will depend on the particular goal of the site-directed 
mutagenesis. For example, embryonic stem cells or zygotes may 
be targeted for generating modified animals; whereas both germ- 
line and somatic cells may be usefully targeted for gene 
therapy. The choice of cells may also affect (or be affected 
by) the choice of transformation technique, as discussed below. 
Growth and manipulation of the cells can be performed using 
standard procedures as described in Hogan, B. , et al, 
Manipulating the Mouse Embryo , Cold Spring Harbor, New York 
(1986) . 

E. introduction of the DNA into the Cells 
Any technique that can be used to introduce DNA into 

25 the animal cells of choice can be employed. Electroporation 
has the advantage of ease and has been found to be broadly 
applicable, but a substantial fraction of the targeted cells 
may be killed during electroporation. Therefore, for sensitive 
cells or cells which are only obtainable in small numbers, 

30 microinjection directly into nuclei may be preferable. Also, 
where a high efficiency of DNA incorporation is especially 
important, such as targeting without the use of a selectable 
marker (as discussed above) , direct microinjection into nuclei 
is an advantageous method because typically 5-25% of targeted 

3 5 cells will have stably incorporated the microinjected DNA. 

Retroviral vectors are also highly efficient but in some cases 
they are subject to other shortcomings, as described by Ellis, 
J., and A. Bernstein, Molec. Cell. Biol. 9:1621-1627 (1989). 
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Where lower efficiency techniques are used, such as 
electroporation, calcium phosphate precipitation or liposome 
fusion, it is preferable to have a selectable marker in the 
targeting DNA so that stable transf ormants can be readily 
5 selected, as discussed above. A variety of such transformation 
techniques are well known in the art, including: 

(1) Direct microinjection into nuclei: 

Targeting constructs can be microinjected directly 
10 into animal cell nuclei using micropipettes to mechanically 

transfer the recombinant DNA. This method has the advantage of 
not exposing the DNA to cellular compartments other than the 
nucleus and of yielding stable recombinants at high frequency. 
See, Capecchi, M. , Cell 22:479-488 (1980). 

15 

( 2 ) Electroporation : 

The targeting DNA can also be introduced into the 
animal cells by electroporation. In this technique, animal 
cells are electroporated in the presence of DNA containing the 

20 targeting construct. Electrical impulses of high field 

strength reversibly permeabilize biomembranes allowing the 
introduction of the plasmids. The pores created during 
electroporation permit the uptake of macromolecules such as 
DNA. Procedures are described in, e.g., Potter, H. , et al., 

25 Proc. Nat'l. Acad. Sci. USA 81:7161-7165 (1984); and Sambrook, 
ch . 16 . 

(3) Calcium phosphate precipitation: 

The targeting constructs may also be transferred into 
3 0 cells by other methods of direct update, for example, using 
calcium phosphate. See, e.g., Graham, F. , and A. Van der Eb, 
Virology 52:456-467 (1973); and Sambrook, ch.16. 

(4) Liposomes: 

35 Encapsulation of DNA within artificial membrane 

vesicles (liposomes) followed by fusion of the liposomes with 
the target cell membrane can also be used to introduce DNA into 
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animal cells. See Mannino, R. and S. Gould-Fogerite, 
BioTechniaues . 6:682 (1988). 



(5) Viral capsids: 

5 Viruses and empty viral capsids can also be used to 

incorporate DNA and transfer the DNA to animal cells. For 
example, DNA can be incorporated into empty polyoma viral 
capsids and then delivered to polyoma-susceptible cells. See, 
e.g., Slilaty, S. and H. Aposhian, Science 220:725 (1983). 

10 

(6) Transfection using polybrene or DEAE-dextran: 

These techniques are described in Sambrook, ch.16. 

(7) Protoplast fusion: 

15 Protoplast fusion typically involves the fusion of 

bacterial protoplasts carrying high numbers of a plasmid of 
interest with cultured animal cells, usually mediated by 
treatment with polyethylene glycol. Rassoulzadegan , M. , et 
al. f Nature . 295:257 (1982). 

20 

(8) Ballistic penetration: 

Another method of introduction of nucleic acid 
segments is high velocity ballistic penetration by small 
particles with the nucleic acid either within the matrix of 
25 small beads or particles, or on the surface, Klein, et al. , 
Nature . 327 . 70-73, 1987. 

F. Selection for Integration Events 

30 In some situations, the gene targeting event will itself 

result in a selectable phenotype, in which case the targeted 
cells can be screened directly for homologous recombination. 
For example, disrupting the gene hprt results in resistance to 
6-thioguanine. In many cases, however, the targeting will not 

35 result in such an easily selectable phenotype and, if a low 

efficiency transformation technique such as calcium phosphate 
precipitation is being used, it is preferable to include in the 
targeting DNA construct £ selectable marker such that the 
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stable integration of the targeting DNA construct in the genome 
will lead to a selectable phenotype. For example, if the 
targeting DNA contains a neo gene, then selection for 
integrants can be achieved by selecting cells able to grow on 
G418. 

The relative frequency of targeting to a gene may be 
further improved by using a selectable marker which lacks its 
own promoter, since the likelihood of adequate expression of 
the selectable marker is greater where integration into a gene 
has occurred than for integration into the large parts of the 
genome that are believed to be transcriptionally quiescent. 

G . Isolation of Homologous Recombinants 

Using the present invention, the frequency of 
homologous recombination relative to random integration into 
the genome is substantially improved. The frequency is 
typically improved by a factor of 5 to 10, 50 to 100 or 1000 or 
more depending upon the particular old, targeting sequencers 
and other parameters known by the skilled artisan. In some 
cases, it will be feasible to directly obtain cells in which 
homologous recombination at the target locus has occurred. For 
example, gene targeting may itself result in a readily 
selectable phenotype. Also, selectable markers in the 
targeting DNA can be employed which will be preferentially 
expressed upon integration into the target gene by homologous 
recombination. Another approach is to utilize the polymerase 
chain reaction to screen the cells for homologous recombinants. 
See, e.g., Zimmer, A., et al., Nature, Vol. 338, pp. 150-153 
(1989); and Joyner, A., et al., Nature, Vol. 338, pp. 153-156 
(1989) . 

However, using the present invention, a relatively 
large fraction of the stable integrants will be correctly 
targeted to the gene of interest rather than incorporated at 
random sites throughout the genome. Accordingly, it will be 
feasible to obtain homologous recombinants without the 
necessity of employing any special selection protocols or 
carrying out PCR-based screening. 
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The standard approach for confirming that a cell has 
undergone a homologous recombination event is to isolate 
genomic DNA and perform a Southern hybridization analysis to 
demonstrate that genomic DNA fragments hybridizing with a 
labelled probe of the target DNA have been rearranged because 
of the modification of the target DNA. Southern hybridization 
is described in Sambrook and Maniatis. Given the high 
frequency of homologous recombination obtainable with the 
present invention, the targeted cells can be checked directly 
for homologous recombination. 

H. Targeting Both Alleles of a Targ et Sequence 

Where the cells contain more than one copy of a gene, 
the cell lines obtained from the first round of targeting are 
likely to be heterozygous for the targeted allele. 
Homozygosity, in which both alleles are modified, can be 
achieved in a number of ways. One approach, exemplified below, 
is to grow up a number of cells in which one copy has been 
modified and then to subject these cells to another round of 
targeting using a different selectable marker. Alternatively, 
homozygotes can be obtained by breeding animals heterozygous 
for the modified allele, according to traditional Mendel ian 
genetics . 

In some situations, it may be desirable to have two 
different modified alleles. This can be achieved by successive 
rounds of gene targeting or by breeding heterozygotes, each of 
which carries one of the desired modified alleles. 

I. Production of Genetically Altere d Animals 

Embryonic stem cells which have been modified can be 
injected into the blastocoel of a blastocyst and grown in the 
uterus of a pseudopregnant female. In order to readily detect 
chimeric progeny, the blastocysts can be obtained from a 
different parental line than the embryonic stem cells. For 
example, the blastocysts and embryonic stem cells may be 
derived from parental lines with different hair color or other 
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readily observable phenotype. The resulting chimeric animals 
can be bred in order to obtain non-chimeric animals which have 
received the modified genes through germ-line transmission. 
Techniques for the introduction of embryonic stem cells into 
blastocysts and the resulting generation of chimeric animals 
are well known ( see e.g. , Bradley, A. Production and analysis 
of chimaeric mice , pp. 113-151 in Robertson, E. (ed.), 
Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach , Oxford IRL Press (1987); and Hogan, B. , et al, 
Manipulating the Mouse Embryo , Cold Spring Harbor, New York 
(1986) ) . 

Targeting DNA can also be introduced directly into a 
zygote nucleus using, for example, microinjection. Selectable 
markers and/or other aspects of the present invention can be 
employed and the zygotes can be grown into animals using 
techniques well known in the art. In the case of mammals, the 
targeted organism can be introduced into the uterus of a 
pseudo-pregnant female capable of carrying the developing 
animal to term. 

J. Somatic Gene Therapy 

Similarly, the methods described above can be 
employed for somatic gene therapy to, e.g., alter the 
expression of a gene, or correct a defective gene, or introduce 
a new gene in somatic cells of a human or other animal. The 
somatic cells are first modified, using the methods described 
above, and then introduced into the same or a different 
individual (see Friedman, Science 244 ; 1275-1281) . 

EXAMPLE 1 

Gene Targeting Using Isogenic Targeting Constructs. 

A. Targeting constructs: 

The target DNA selected was the retinoblastoma 
susceptibility gene (Rb) in mouse embryonic stem cells of line 
E14. The targeting DNA consisted of a 10.5 kb Hpal fragment of 
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Rb sequence from around the 19th* and 2 0th exons of the gene 
(see Fig. lb) • 

Two different sources of Rb sequence were used for 
the targeting DNA. The 10.5 kb targeting DNA sequence was 
either isolated from a mouse strain 129-derived DNA library 
("129Rb"), or a BALB/c-derived DNA library ("B/cRb") . Since 
the recipient embryonic stem cells were also derived from mouse 
strain 129 (Hooper, M. , et al., Nature 326:292-295 (1987)), the 
129Rb targeting DNA will be substantially isogenic with the 
target DNA. As discussed below, the B/cRb targeting DNA 
sequence is very similar to the 129Rb sequence but differs by 
about 0.5-1.0% (i.e. one sequence difference per 100-200 
nucleotides) . 

The chosen sequence modification was disruption of 
the Rb gene coding sequence by insertion of the neomycin, 
phosphotransferase (neo) gene into the 19th exon of the gene. 
The neo marker was derived from plasmid pMClneo poly (A) 
(Thomas, K. , and M. Capecchi, Cell 51:503-512 (1987)). A 
mutation present in the neo coding sequence and reducing its 
ability to confer G418 resistance was corrected (see Yenofsky, 
R. , et al. Proc. Nat'l Acad. Sci. USA 87:3435-3439 (1990). The 
neo marker was flanked by 2.5 and 8.0 kb of Rb sequence. The 
resulting constructs, 129Rb- neo and B/cRb-neo are shown in 
Fig. lb. The targeting DNA sequences were separated from 
flanking vector DNA by cleavage with a restriction enzyme 
followed by gel electrophoresis and purification by electro- 
elution. 

B. Gene targeting 

Embryonic stem cell line E14, derived from mouse 
strain 129, was grown on BRL conditioned medium (Hooper, M. , et 
al., Nature 326:292-295 (1987)). Cells (3xl0 7 ) were mixed with 
90 micrograms of targeting DNA (either 129Rb-neo or B/cRb-neo) 
in a volume of 600 microliters of PBS buffer and electroporated 
using a Biorad Gene pulser (0.8 kV, 3 micro F, electrode 
distance 0.4 cm) . Cells were reseeded on 10-cm tissue culture 
dishes at a density of about 10 7 cells per plate. G418 (200 
micrograms/ml) selection was started after one day; after eight 
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days, colonies were randomly picked and grown up for analysis. 
C, Analysis 

Double crossing-over at the Rb-locus will integrate 
5 the neo marker into the 19th exon of the Rb gene, thereby 
disrupting the coding sequence (Fig. la) . G418-resistant 
colonies obtained from both electroporation experiments were 
analyzed by Southern hybridization. DNA from individual G418- 
resistant colonies was analyzed in the following way: l-2xl0 6 

10 cells were embedded in 50 microliters of 05% of low melting 
point agarose in PBS buffer and incubated in 1 ml of EDTA 
(0.5M), Sarcosyl (1%) and Proteinase K (1 mg) for 48 h at 50°C. 
Agarose blocks were washed three times in Tris (lOmM), EDTA 
(lOmM) , pH 8 plus PMSF (0.1 mM) and once in the appropriate 

15 restriction enzyme buffer. DNA digestion took place in 100 

microliters of restriction enzyme buffer containing 50 units of 
restriction enzyme EcoRI for 6 hours at 37 °C. Agarose blocks 
were melted at 65 °C and loaded onto 0.7% agarose gels for 
Southern analysis following standard procedures (see Maniatis 

2 0 or Sambrook) . Using fragment A (Fig. lb) as the hybridization 
probe, the non-modified Rb locus appears as a band of 9.7 kb 
(Rb) ; whereas integration of neo by homologous recombination 
gives a 4 . 9 kb band ( neo ) . 

25 D. Results 

Although the targeting constructs 129Rb-neo and 
B/cRb-neo were identical, except for the origin of the Rb 
sequence, the results obtained with the two constructs were 
different. Using B/cRb-neo, 1 homologous recombinant was 

30 detected amidst 144 random integration events. In contrast, of 
94 analyzed G418-resistant colonies obtained with 129Rb-neo, 33 
underwent homologous recombination at one of the Rb alleles. 
Thus, gene targeting was about 45-fold more efficient with 
129Rb-neo than with B/cRb-neo. The isogenic targeting 

35 construct allowed the easy recovery of homologous recombinants 
(1 out of 3 resistant colonies) without the use of any 
enrichment protocol. 



E. Analysis of the sequence divergence between the 129 and 
BALB/C ta rgeting DNAs 

A comparison of the 12 9 -derived and the BALB/c- 
derived DNAs , ("129Rb" and "B/cRb, 11 respectively) was made to 
confirm that even small amounts of sequence divergence can 
substantially affect the frequency of homologous recombination. 
The 10.5 kb Rb fragments present in targeting constructs 129Rb- 
neo and B/cRb-neo were separated into nine smaller fragments 
(as shown in Fig. 2). Two of these fragments fell into regions 
that were entirely sequenced. The remaining seven fragments 
were separated in a low melting point agarose gel, recovered 
from the gel, and digested with Hinfl, Taal. or both, using 
standard techniques (see e.g., Sambrook, supra). The digested 
fragments were radioactively labelled and analyzed in a 
sequencing gel (see, id.). 

The restriction digestion patterns of the two 
fragments were identical for 8 out of the 13 enzymes tested, 
showing that no gross alterations had occurred. Five 
restriction site polymorphisms were seen, suggesting that 5 
base pair substitutions were present within the 275 basepairs 
(bp) analyzed in this way. Second, 1102 nucleotides around the 
site where the neo marker was inserted and 585 nucleotides 5 kb 
away from this site were sequenced. Within these two regions 
(containing 1687 nucleotides) nine basepair substitutions, 
three small deletions (1, 4 and 6 nucleotides) and a 
polymorphic CA-repeat (a 14 bp deletion) were detected in the 
BALB/c sequence with respect to the 129 sequence. The longest 
stretch of perfect homology within the sequenced region was 278 
nt. Finally, to detect deletions/ insertions in the remainder 
of the targeting constructs, the 10.5 kb ^ fragments were 
digested into 9 smaller fragments (see Fig. 2) . Two of these 
fragments fell in the region already sequenced, the remaining 
seven were further digested with restriction enzymes, 
radioactively labelled and analyzed on a sequencing gel. By 
this analysis, 3 deletions (2, 2 and 5 nucleotides) and three 
small insertions (1, 2 and 10 nucleotides) were detected in the 
BALB/c fragment with respect to the 129 fragment. A summary of 
the sequence and restriction fragment length analyses is given 
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in Fig, 2. Based on these results, we estimate that on the 
average one sequence difference (a base pair substitution or a 
deletion/ insertion) was present per 160 nucleotides, for an 
overall sequence divergence in the range of about 0. 5-1.0%. 
5 Thus, even though the two targeting constructs shared an 
average sequence identity of about 99%, they nevertheless 
exhibited a significant difference in their efficiency as gene 
targeting substrates. 
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EXAMPLE 2 

Successive targeting using two different selectable markers, 

targeting a selectable marker 

15 A, First round of gene targeting 

The targeting construct contained a selectable 
marker, an hprt minigene, embedded in 17 kb of targeting DNA 
from the retinoblastoma susceptibility gene derived from mouse 
line 129 (see Fig. lc) . The cells to be targeted were the 

20 mouse embryonic stem cell line E14Tg2a, an HPRT-minus 

derivative of cell line E14 (which was derived from cell line 
129; see Hooper, M. , et al., Nature 326:292-295 (1987)). Cells 
were electroporated with targeting DNA as described in Example 
2. 

25 Integration of the h prt minigene into the ES cell 

genome results in the acquisition of the ability to grow on HAT 
medium. Of 35 tested colonies that were selected on HAT 
medium, 8 contained the hprt -minigene correctly integrated into 
the 19th exon of one of the Rb alleles via homologous 

30 recombination. None of the homologous recombinants contained 

additional hprt copies integrated elsewhere in the genome. One 
of these clones, designated HAT-20, was used as the recipient 
for a second targeting experiment. 

3 5 B. Second round of crene targeting 

Clone HAT-20 was subjected to gene targeting using 
the constructs 129Rb-neo and B/cRb-neo (described above in 
Example I) . HAT-20 cells were electroporated with 90 



micrograms of targeting constructs B/cRb-neo and 129Rb- neo and 
the linearized vector pMClneo poly (A) . G418 R colonies were 
scored after 8 days; 6-Thioguanine (10 jig/ml) was added and 
surviving colonies were counted 8 days later. From each 
electroporation experiment individual colonies were picked and 
grown up for DNA analysis. Double crossing-over at the 
previously targeted Rb allele will substitute hurt for neo . 
giving colonies resistant to both G418 (neo+) and 6-Thioguanine 
(Hprt-) . The ratio of homologous recombinants (resistant to 
both 6-TG and G418) to the total number of integrations (G418 R ) 
was much higher with 129Rb- neo than with B/cRb-neo (see Table 
1) . Some 6-TG-resistant colonies were also seen after 
electroporation of HAT-20 with the plasmid pMClneo poly (A), 
albeit at a much lower rate than with either targeting 
construct (Table 1) . 

DNA of individual clones ( 6— TG R and G418 R ) was 
digested with PstI and analyzed by Southern hybridization. 
Using fragment A (Fig. lb) as a probe, bands of the expected 
size appeared, corresponding to the wild type Rb allele (4.9 
kb) , the Rb allele containing hprt (7.7 kb) and the Rb allele 
containing neo (3.9 kb) . Colonies resistant to both 6-TG and 
G418, obtained upon electroporation of HAT-20 with B/cRb-neo 
(a), 129Rb-neo (b) and pMClneo poly(A) (c) were analyzed as 
described in Example I. 

DNA analysis of 18 colonies obtained with 129Rb- neo 
confirmed that all 18 resulted from homologous recombination 
with the target allele. In contrast, analysis of the colonies 
obtained using the . B/cRb-neo construct demonstrated that 14 out 
of 29 colonies resistant to 6-TG resulted from the spontaneous 
loss of the hort containing allele rather than from homologous 
recombination. Analysis of colonies obtained using pMClneo- 
poly(A) revealed that they had all lost the hprt containing Rb 
allele, possible by loss of the entire chromosome. Corrected 
for the spontaneous loss of the h prt minigene in the HAT-20 ES 
cell line, the frequency of homologous recombination was 1/200 
for the B/cRb construct, but reached 1/10 using the isogenic 
targeting construct (129Rb) . In summary, using isogenic DNA 
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resulted in a 20-fold increase in the efficiency of gene 
targeting by homologous recombination. 
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Table 1 Efficiency of homologous recombination 



10 



15 



20 



25 



DNA 

B/cRb-neo 
129Rb-neo 
pMClneo p(A) 



Number 
of cells 
(HAT-20) 

5X10 7 

5X10 7 

2.5X10 7 



G418 K 
(total) 

11500 

13500 

5470 



G418 R & 6-TG R 
(HR) 

105 

1260 

11 



Efficiency * 
(HR/ total) 

1/200 

1/10 



* Corrected number of homologous recombinants (HR) divided by 
total number of G418 R colonies obtained. DNA analysis revealed 
that, in the case of B/cRb-neo, about half of the 6-TG R 
colonies resulted from spontaneous loss of the hort allele 
rather than homologous recombination. The same was true for 
all of the colonies resulting from pMClneo poly (A) . For the 
isogenic construct (129Rb-neo) , all of the colonies examined 
resulted from homologous recombination* 
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EXAMPLE 3 

Targeting both alleles of a gene; and a comparison of 
positive/negative selection and isogenic targeting. 

5 In the first step, the retinoblastoma (Rb) allele of 

mouse embryonic stem cell line E14 was disrupted by homologous 
recombination with a BALB/c-derived targeting construct 
employing a standard positive/negative selection strategy as 
described by Capecchi and co-workers (see Mansour, S., et al, 

10 Nature 336:348-352 (1988), and using approximately 18 kb of Rb 
targeting sequence , three correct integrations of a neo marker 
into the 19th exon of the Rb gene were isolated from a 
background of 3 600 random integration events. 

One of these single Rb knock-out cell lines was used 

15 as the recipient in a second electroporation experiment with an 
isogenic targeting construct, consisting of a hygromycin 
resistance gene ( hvcr ) embedded in 17 kb of a 129— derived Rb 
sequence (see 129Rb-hyg, Fig* Id) . Electroporation conditions 
and DNA analysis were similar as described in the legend to 

20 Fig. 1. In a typical experiment, 8xl0 7 cells were 

electroporated with 90 micrograms of 129Rb- hva DNA (Fig. Id) . 
Hygromycin B (150 micrograms/ml) selection was started after 
one day. Approximately 15,000 hygromycin B resistant colonies 
were obtained and, after 12 days of growth, a number of 

25 individual colonies were picked and grown up for DNA analysis. 
DNA derived from 61 different Hygromycin B-resistant colonies 
was digested with EcoRI and analyzed by Southern hybridization. 
Using fragment B (Fig. Id) as a probe, different sized bands, 
corresponding to the non-modified Rb allele (9.7 kb) , the Rb 

30 allele with neo integrated (11.5 kb) and the Rb allele with hva 
integrated (4.9 kb) , could be observed. The Southern analysis 
revealed that approximately 75% of the Hygromycin B-resistant 
colonies tested (48 out of 61) resulted from homologous 
recombination. Thus, not only were homologous recombinants 

35 readily obtainable, they were the predominant type of cell . 
arising from integration of the targeting DNA. Furthermore, 
all 48 of these lines had undergone homologous recombination at 
the Rb locus. In 40 of the lines, the hyq gene was correctly 
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integrated into the remaining wild-type copy of the Rb gene 
thus giving cell lines in which both Rb alleles had been 
disrupted. In the other 8 lines r the hyg targeting DNA had 
incorporated by homologous recombination but the target had 
5 been the already modified allele in which the hyg; targeting 
sequence replaced neo. By selecting the recombinants on both 
G418 and hygromycin, it is possible to select against cells in 
which the second targeting DNA has merely replaced the first. 
The results also exemplify the effect of using 

10 isogenic targeting. With a fairly homologous targeting DNA, 

and employing a positive/ negative selection strategy, less than 
0.1% of the cells (approximately 1/1200) were homologous 
recombinants. In contrast, using isogenic targeting DNA, about 
75% of the cells were correctly targeted without having to 

15 employ special selection techniques. 

The invention described and claimed herein is not to 
be limited in scope by the specific embodiments herein 
disclosed, since these embodiments are intended as 
illustrations of several aspects of the invention. Any 

2 0 equivalent embodiments are intended to be within the scope of 
this invention. Indeed various modifications of the invention 
in addition to those shown and described herein will become 
apparent to those skilled in the art from the foregoing 
description. Such modifications are also intended to fall 

25 within the scope of the appended claims. 



